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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Glu Glu lie Cys Arg Asn Pro Val Thr Asp Asn Val Lys Asp lie Thr 
15 10 15 

Lys Leu Val Ala Asn Leu Pro Asn Asp Tyr Met lie Thr Leu Asn Tyr 
20 25 30 

Val Ala Gly Met Asp Val Leu Pro Ser His Cys Trp Leu Arg Asp Met 
35 40 45 

Val Thr His Leu Ser Val Ser Leu Thr Thr Leu Leu Asp Lys Phe Ser 
50 55 60 

Asn lie Ser Glu Gly Leu Ser Asn Tyr Ser lie lie Asp Lys Leu Gly 
65 70 75 80 

Lys He Val Asp Asp Leu Val Ala Cys Met Glu Glu Asn Ala Pro Lys 
85 90 95 

Asn Val Lys Glu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro 
100 105 110 

Glu Glu Phe Phe Ser He Phe Asn Arg Ser He Asp Ala Phe Lys Asp 
115 120 125 

Phe Met Val Ala Ser Asp Thr Ser Asp Cys Val Leu Ser Ser Thr Leu 
130 135 140 

Gly Pro Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu 
145 150 155 160 

Pro Pro Val Ala Ala 
165 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: modi f ied_base 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= Inosine 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
ACRTTYTTNG GNGCRTTYTC YTCCAT 2 6 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 12 & 15 

(D) OTHER INFORMATION: /mod_base= Inosine 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
AARAAYTCYT CNGGNGTRAA RTT 23 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTYTCNGGYT TYTT 14 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
ATGGARGARA AYGCCCCCAA RAAYGT 2 6 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CCNAAYGAYT AYATGWTMAC 20 
(2) INFORMATION FOR SEQ ID NO : 7 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GGNGGNARCA TRAANGGYTT 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
ACCAKAARAT CTTYAAANCG ATC 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GTATTTTCAA TAGATCCATT GA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
CCAACTATGT CGCC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GTAGTCAAGC TGACTGATAA G 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
TAACCAACAA TGACTAGGCA A 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
TTCCAGAGTC AGTGTC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GCGAAGCTTG CCTTTCCTTA TGAAGAAGA 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GCGCCGCGGT TACGGTGGTA ACATGAAGGG CTTTGTGA 
(2) INFORMATION FOR SEQ ID NO : 1 6 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 6 
GATAAATGCA AGTGATAATC C 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GCGGTCGACC CGCGGAACTT TAAGTCCATG CAACAC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CACCCGCGGT TATGCAACAG GGGGTAACAT AAATGG 
(2) INFORMATION FOR SEQ ID NO : 1 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CACCCGCGGT TAGGCTGCAA CAGGGGGTAA CATAAA 
(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CTTAATGTTG AAGAAACC 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GATGGTAGTA CAATTGTCAG AC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GTCTGACAAT TGTACTACCA TC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CAATTTAGTG ACGTCTTTTA CA 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
TTAGATGAGT TTTCTTTCAC GCAC 
(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 




18 
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22 
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(A) LENGTH: 24 base pairs 

(B) TYPE:, nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
AAATCATTCA AGAGCCCAGA ACCC 
(2) INFORI^TION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
AACATCCATC CCGGGGAC 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
CTGGCAATAT TTTAAGTCTC AAGAAGACC 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
GCGCCGCGGC TCCTATAGGT GCTAATTGG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
CCTCACCACT GTTTGTGCTG GATCGCA 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GGTGTCTAGA CTTGTGTCTT CTTCATAAGG A 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CCCCCCCHGG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TTTTTTTTTT TTTTTTTTGG 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
TTTTTTTTTT TTTTTTTT.^\G 
(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 




27 



31 



10 



20 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TTTTTTTTTT TTTTTTTTCG 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 
TTCGGCCGAT CAGGCCCCCC CCCC 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TTCGGCCGGA TAGGCCTTTT TTTTTTTTTT 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGCCGGATAG GCCTCACNNN NNNT 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GGCCGGATAG GCCTCAC 

(2) INFOR^4ATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: j oin ( 660 . . 7 73 , 1184.. 1246, 2053.. 2223, 2837.. 2993, 

3692.-3774) 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: j oin { 720 . . 7 73 , 1184.. 1246, 2053.. 2223, 2837.. 2993, 



60 
12 0 
,180 
240 
300 
- 360 
420 
480 
540 
600 
659 
707 

755 



AAA GAC ATT ACA AAA CTG GTAAGTAAAG AATGATTTTG GCATCTATi\A 803 
Lys Asp lie Thr Lys Leu 
15 

GTCTTCCCTG TGCTTGCTGA CCACATAGGT TCAGGGCACT CCCGACAGGA GTTCCCAGCT 8 63 

TTCTAAGATA AGGAATCACT GTACGAGTCT GAAGTGCTTC TTCTGGGCAA ATGGGAGATG 923 





3692. 


.3774) 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 






AAAGTATCTT 


TCTATTGGCG 


AAGGACATGT 


TTTCCCATAA 


GTGGTAAACA 


AACTGTCTGC 


ACATAATAAT 


TATCTTGCTG 


CCGTAAAGAT 


TAGGTTAAAT 


TCTGCCTTCG 


ATCTAA/iAAC 


ACACCCTTCT 


GTCAATCCGA 


GGAGCAGTGT 


GCTAGTCTAG 


AGGTCTAAAT 


GAAGGCTCCT 


TTCACGGTTG 


TATTTCTGCT 


CCCCAAATTG 


TCCACATTTA 


AAAGGAGAGT 


GCTTCTTTTC 


AGCCTTAGGC 


TCTGAATTTC 


ATGCATTCCT 


CCATTTTCCG 


AGGTCCCCCC 


CAAGTGATAA 


TTCTGTTACA 


CGTTGCTACA 


AGTTCATCCC 


TAATTGCCGT 


CAAGAAACTG 


ACTGTAGT^G 


GCTTACCACA 


GACGTTGT7\A 


CCGACAGTAA 


AGCCATTGAA 


AGAGTAATTC 


AAACAGGATG 


GAAGCCAGGA 


GTATTTTGTG 


GCTGTTGCTC 


TTTTTCTTTT 


CAGTTTGGTG 


AGAGCAGCTT 


GAATGCTTAA 


CATTTAAGCC 


ATCAGCTTAA 


AACAAAACAA 


AACAAAACAA 


AAAAAAACCC 


CGCTCTGGCA 


TATTTGCACT 


TAACACATAC 


GGTATAAGGT 


GTTACTGGTT 


TGCATAGTTC 


TGGATTTTTT 


TTTTTTAAAA 


ACTGATGGAC 


ACCAAGTVT^AT 


GTTTCTGTTC 


TTTGTTTAG 


ACT TGG ATT ATC ACT TGC ATT TAT 
Thr Trp lie lie Thr Cys lie Tyr 
-20 -15 


CTT CAA CTG 
Leu Gin Leu 
-10 


CTC CTA TTT AAT CCT 
Leu Leu Phe Asn Pro 
-5 


CTC GTC AAA ACT CAG GAG ATC TGC 
Leu Val Lys Thr Gin Glu lie Cys 

1 


AGG AAT CCT 
Arg Asn Pro 
5 


GTG ACT GAT AAT GTA 
Val Thr Asp Asn Val 
10 
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CTTAGGTCAT GGAGGGTTTA TCTGTATAAC TGGCCCTTTG CACACCAACA AAGTGACTGA 98 3 

CTGGCTTTTG CCTGTTACCT ACTGTCTCCA GTCCTGGGCA TGGTATATAC TTAGGCACCC 104 3 

AAGATTGGAT TTACAACTCA AGCATTATAT ATTGGACAAC ACGGGGTATG AGATATTAAT 1103 

GATATGTCAG GTTGGATGGA TGAGTTTTCT CAAGAAATTC TCTTGTATTT ACTCACGTTT 1163 

TCATTTCTTG GTCTCTGTAG GTG GCG AAT CTT CCA AAT GAG TAT ATG ATA 1213 

Val Ala Asn Leu Pro Asn Asp Tyr Met lie 
20 25 



ACC CTC AAC TAT GTG GCC GGG ATG GAT GTT TTG GTATGTAGTC CACACACTTC 12 66 

Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu 
30 35 



TGAGTTGCCT 


TTTAGTAGCT 


AATGGGTGAC 


CTGTGCTTAT 


TCACATTGAA 


GACATTATTT 


1326 


GCTCTTTGTC 


GTTTTTAGAT 


GTTGACCTAT 


AATTTTTCCT 


TCAAGCTGCT 


GCTAAGATTA 


1386 


TCAGTGAGCA 


TTTCAGTATG 


TGTTTTAAGC 


CTACTCATTA 


AAAGGAAATG 


GCTCATCTTA 


1446 


GACGTAGCAA 


CCGATGTTAA 


TTTTTCCCCA 


GGCATCTCTC 


AGAGGGACTT 


GAATGTTAAA 


1506 


ATCATGTTAA 


ATTTCCTCCT 


TGGCTATGTT 


ATTTCTCATG 


GCTATGTTAT 


TCCTATTCGT 


1566 


AT TTG AT TTA 


AAGGGACGGA 


ATATTTATTG 


TATTTCTGAA 


CTTTTTCAGG 


CATGGATCCG 


162 6 


GGTCTTTGAA 


TAAAACACTA 


AGACTCCTTC 


TAGTAATGTT 


TGTAATCCTG 


TCTGTATCGA 


168 6 


ATGTCTTTGA 


AAACGCAGTG 


ACTAAGCCAT 


AAATAATCTT 


CCACAGAACG 


TCCAGTGGTT 


1,74 6 


CATGAACTTT 


GTATGTGGGG 


GTGGGGCAAG 


AATTGTCTCA 


CTATTGGTCA 


AGGAAGAGAA 


1806 


GGTAAGGTAT 


GCAAGGGTGG 


TTTAATCTTC 


TTCCGTGGAA 


GGACAAAATC 


ATCTATCATT 


1866 


TCCTCTGATC 


TCTATGCATT 


TGTTTGTTTT 


GAACTGTVATC 


TGACTTGAGC 


AAGAGTTGGC 


1926 


GTCCTGTGTT 


CTGAGGAAAC 


TCTTTGTCCT 


GCAGTCAGTG 


ACTAAAAGTG 


CTGAGAGATC 


.1986 


TGAAGAGCAC 


TCTGAATCTG 


CCATATTTTT 


AATAGATGCT 


TTGTCTTCTC 


TTTGAATTTC 


2046 


TTCCAG CCT 
Pro 


AGT CAT TGT TGG TTA CGA GAT ATG GTA ACA CAC 
Ser His Cys Trp Leu Arg Asp Met Val Thr His 


TTA TCA 
Leu Ser 


2094 



40 45 50 

GTC AGC TTG ACT ACT CTT CTG GAC J\AG TTT TCA AAT ATT TCT GAA GGC 214 2 

Val Ser Leu Thr Thr Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly 
55 60 65 

TTG AGT AAT TAT TCC ATC ATA GAC AAA CTT GGG AAA ATA GTG GAT GAC 2190 
Leu Ser Asn Tyr Ser lie lie Asp Lys Leu Gly Lys lie Val Asp Asp 
70 75 80 85 

CTC GTG GCA TGT ATG GAA GAA AAT GCA CCT AAG GTAACTTGGT ATTCATCAGA 22 4 3 
Leu Val Ala Cys Met Glu Glu Asn Ala Pro Lys 
90 95 

ATTATTTTTC TTATACTGAG CTCATGATGA GCAATTCACA ACCACTTGTA ATTCCAGCTC 2 303 

CAGAGGACAT TATCCCCTCT TTGGATGCCA TAGGAATCTG CTCTCAAATA TGTAGATACC 2363 

ACCTCTGCCA CCTCAGCACA TACATACACA TAATTAAAAA ATAGAAACAT TAAAGGAGTT 24 23 
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CCAATCAATC CTTATTCTTT 

TTTTAAAGGC TAATTCTTAT 

GGTCACTTTA TTGTCCTATT 

GTTTTGTTTA TTGTTCCCTA 

TCTCTGTAGC CATTTTGAGT 

AGGACATAGT CAATAATCTA 

AGAATCTTTG TTAATTATTT 



TCTGTATTCA GTATGCCCAG 
TTTGTAATAA GCAGCTTTAA 
GCCACGACAT TGTCCTGTCC 
GTTACTTTGA TCATGAGATT 
TGTGTCTATT AGAACAGCTG 
TTATGCTGAT CCAGTCAAGT 
GTTTGCTTGT TTGTTTGTTT 




ATGTAAATTC TAGGAATATG 

AATTCTTAAT TGTTTTTTCG 

CATTGTCTGT TATTCCTTCT 

GACCTGTTAC CCGTTGTTAT 

TTAAATTACT TGAATCATTG 

CTATGAGTTA TTTGAAAACT 

ATTATTTGTC TAG AAT 
Asn 



GTA AAA GAA TCA CTG AAG AAG CCA G7VA ACT AGA AAC TTT ACT CCT GAA 

Val Lys Glu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu 
100. 105 110 

GAA TTC TTT AGT ATT TTC AAT AGA TCC ATT GAT GCC TTC AAG GAC TTC 

Glu Phe Phe Ser lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp Phe 
115 120 125 

ATG GTG GCA TCT GAC ACT AGT GAT TGT GTG CTC TCT TCA ACA TTA GGT 

Met Val Ala Ser Asp Thr Ser Asp Cys Val Leu Ser Ser Thr Leu Gly 

130 135 140 145 

CCT GAG AAA G GTAAGGCTTT TAAGCATTTC TTGTTTAAAT GTACATAGAA 
Pro Glu Lys 



AGCCTGAACT TCTGTAAGCC TCTACTGCTG 
CGTGTGGGTT TTTCTGATAA AAACAAAAAG 
CTAGCTTGAG AGATATGCCC TAAGACAGCG 
GTGATGGTGG CTCACATATT TTCTAACTGT 
TTCTTCATCC CCAGAATTCC TAAGATTTCA 
TCTGAGACTT GAGTTCTGAG CCTCAGCAGG 
GGTCCCTATG GGGGAACTAA CTTCATTGCT 
CTCCTCGCTA TTATAAACAT GTTCCTAAGC 
ACAAGACAGT TATGGTGTAT CGCTTGACAA 
TAATCTAGAC TTGGGAGGCA GACCCAGCAC 
GACTCTATGA TTCAAAAGTT CACCATGTAA 



AATCAACTAA ATGTGTTGCT GTAGAAAGAA 

CAAATATCAA TGACTACCAA TGATTATTAT 

ATTCTCGATA TTTCTAAATT AAAGAATTGT 

GATATTTGCC AGGAGAGTAG AATAATGTTA 

CGTCTCATGT CTTTTCCATA AGGTTCAAAC 

TCATTCTGAA TCCCCACTCT CCCCGAGCTG 

TTCTTTTAAA ACATGACGAG TTACCAACAG 

ATGTCTGTGC ATGCAATAAG CCTTCACTCT 

AACTGAGCAG CCAAGCTGAG TATGAAATAA 

CTACTGTGAT ATTGCACTTC GCCTTTGGGG 

CACTGACACA TTATTGCTTT CTATTTAG AT 

Asp 



TCC AGA GTC AGT GTC AGA AAA CCA TTT ATG TTA CCC CCT GTT GCA GCC 
Ser Arg Val Ser Val Arg Lys Pro Phe Met Leu Pro Pro Val Ala Ala 
150 155 160 165 



AGT TCC CTT AGG AAT GAC AGC AGT AGC AGT AAT AGTAAGTACA CATATCTGAT 
Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn 
170 175 



2483 
2543 
2603 
2663 
2723 
2783 
2839 

2887 

2935 

2983 

3033 

3093 
3153 
3213 
3273 
3333 
3393 
3453 
3513 
3573 
3633 
3693 

3741 

3794 
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TTACTGCATG CATGGCTCCA AGTATCCTCT ATAGGAGTGT TGCATGGACT TAAAGTTTAT 3854 

AAATCACTAC TAATAATGCT GTTCTGTCAC TGTTATTCCT TGTATGGGCT TCCTGAC7VAT 3914 

TAAATATCTG GTTTGTAGAA TCGGATCTCC TTAGAGGTTA AGATGACCAT GACAAAATTA 3 97 4 

GGCCAATCAA CTTTCTGCGA AGGTTATTTT AAATAAGGCA CGAAATTAAT TGAAGGAATU^ 4 034 

AAAAATACAA GCAAGGCCTT ATTTTGAATC ATGGTAGGCT TAAAATAGAC TTTGTGGAGA 4 094 

ATGTCCCTGA TCAT^lAGTGGA GTTTTCAGAT TTCAAGTGCA TGTGCTAACT CTCCACAATG 4154 

TCAAGGCTAT TTTCAGTTTT GTGTCTCCAT ATTTACTACT GCATGTTTGG AAATTTGCTG 4 214 

ATGCTGTTAG ATTACCTAAG AATGTATGTT GAAGAAGAAT GGACTTCTTT CCCTAAAATT 4 274 

TCTGTCCTCT TTGCCCAAGA ACCCACGTTC CTGGAAGACT ATCTTATTTT CATGTCTGTG 4 334 

CAATGATCAT TATAAAGATT ATTGAATATA CTGGGAATAC TCTGGTTTCT GTTTTTACAG 4 394 

ATTCATAATA GCTTATTCAG TCTTTAAAGA AAGTTCTCTG AAGTCCATGC TTTAGAATGT 4 4 54 

TTCTCTATCA AAACTTGACC TGGACCTTAA ATAAAGCTAT ATTTAGTCTT TTTATCCCTG 4 514 

AAAAATATAT TTCACAGTGT AGACATTTGA TATACATCTA AGGGAAGGAT GCTGCCAGAA 4 57 4 

TGCTCGGGCT GGCAGTCTAC AAAGTCCACT GCTGTCAGGA TGGACTTCTG AAAGCGGAAA .:4';634 

TTGTGAAGTG CATGCATATA ACATATCAGA TCCTCGAGC 4'67 3 
(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHAE^CTERISTICS : 

(A) LENGTH: 196 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



Thr Trp lie lie Thr Cys lie Tyr Leu Gin Leu Leu Leu Phe Asn Pro 
-20' -15 -10 -5 



Leu Val Lys Thr Gin Glu lie Cys Arg Asn Pro Val Thr Asp Asn Val 

15 10 



Lys Asp lie Thr Lys Leu Val Ala Asn Leu Pro Asn Asp Tyr Met lie 
15 20 25 



Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu Pro Ser His Cys Trp 
30 35 40 



Leu Arg Asp Met Val Thr His Leu Ser Val Ser Leu Thr Thr Leu Leu 
45 50 55 60 



Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser Asn Tyr Ser He He 
65 70 75 



Asp Lys Leu Gly Lys He Val Asp Asp Leu Val Ala Cys Met Glu Glu 
80 85 90 
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Asn Ala Pro Lys Asn Val Lys Glu Ser Leu Lys Lys Pro Glu Thr Arg 
95 100 105 

Asn Phe Thr Pro Glu Glu Phe Phe Ser lie Phe Asn Arg Ser lie Asp 
110 115 120 

Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr Ser Asp Cys Val Leu 
125 130 135 140 

Ser Ser Thr Leu Gly Pro Glu Lys Asp Ser Arg Val Ser Val Arg Lys 
145 150 155 

Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser 
160 165 170 

Ser Ser Ser Asn 
175' 

(2) INFORMATION FOR SEQ ID NO : 4 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 26.. 844 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 101.. 844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CTGGATCGCA GCGCTGCCTT TCCTT ATG AAG AAG ACA CAA ACT TGG ATT ATC 52 

Met Lys Lys Thr Gin Thr Trp lie lie 
-25 -20 

ACT TGC ATT TAT CTT CAA CTG CTC CTA TTT AAT CCT CTC GTC AAA ACT 100 
Thr Cys lie Tyr Leu Gin Leu Leu Leu Phe Asn Pro Leu Val Lys Thr 
-15 -10 -5 

CAG GAG ATC TGC AGG AAT CCT GTG ACT GAT AAT GTA AAA GAC ATT ACA 3.4 8 

Gin Glu lie Cys Arg Asn Pro Val Thr Asp Asn Val Lys Asp lie Thr 
15 10 15 

AAA CTG GTG GCG AAT CTT CCA AAT GAC TAT ATG ATA ACC CTC AA.C TAT 196 
Lys Leu Val Ala Asn Leu Pro Asn Asp Tyr Met lie Thr Leu Asn Tyr 
20 25 30 

GTC GCC GGG ATG GAT GTT TTG CCT AGT CAT TGT TGG TTA CGA GAT ATG 24 4 

Val Ala Gly Met Asp Val Leu Pro Ser His Cys Trp Leu Arg Asp Met 
35 40 45 

GTA ACA CAC TTA TCA GTC AGO TTG ACT ACT CTT CTG GAC AAG TTT TCA 2 92 

Val Thr His Leu Ser Val Ser Leu Thr Thr Leu Leu Asp Lys Phe Ser 
50 55 60 
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AAT ATT TCT GAA GGC TTG AGT AAT TAT TCC ATC ATA GAG AAA CTT GGG 34 0 

Asn He Ser Glu Gly Leu Ser Asn Tyr Ser He He Asp Lys Leu Gly 
65 70 75 80 

AAA ATA GTG GAT GAG CTC GTG GCA TGT ATG GAA GAA AAT GCA CCT AAG 38 8 

Lys lie Val Asp Asp Leu Val Ala Cys Met Glu Glu Asn Ala Pro Lys 
85 90 95 

AAT GTA AAA GAA TCA CTG AAG AAG CCA GAA ACT AGA AAC TTT ACT CCT 4 36 

Asn Val Lys Glu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro 
100 105 110 

GAA GAA TTG TTT AGT ATT TTC AAT AGA TCC ATT GAT GCC TTC AAG GAC 484 
Glu Glu Phe Phe Ser He Phe Asn Arg Ser He Asp Ala Phe Lys Asp 
115 120 125 

TTC ATG GTG GCA TCT GAC ACT AGT GAT TGT GTG CTC TCT TCA ACA TTA 532 
Phe Met Val Ala Ser Asp Thr Ser Asp Cys Val Leu Ser Ser Thr Leu 
130 135 140 

GGT CCT GAG AAA GAT TCC AGA GTG AGT GTC ACA AAA CCA TTT ATG TTA 58 0 

Gly Pro Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu 
145 150 155 160 

CCC CCT GTT GCA GCC AGT TCC CTT AGG AAT GAC AGC AGT AGC AGT AAT 62 8 

Pro Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn 
165 170 175 

AGG AAA GCC GCA AAG TCC CCT GAA GAC CCA GGC CTA CAA TGG ACA GCA ^67 6 

Arg Lys Ala Ala Lys Ser Pro Glu Asp Pro Gly Leu Gin Trp Thr Ala 
180 185 190 

ATG GCA CTG CCG GCT CTC ATT TCG CTT GTA ATT GGC TTT GCT TTT GGA 72 4 

Met Ala Leu Pro Ala Leu He Ser Leu Val He Gly Phe Ala Phe Gly 
195 200 205 

GCC TTA TAG TGG AAG AAG AAA CAG TCA AGT CTT ACA AGG GCA GTT GAA -772 
Ala Leu Tyr Trp Lys Lys Lys Gin Ser Ser Leu Thr Arg Ala Val Glu 
210 215 220 

AAT ATA CAG ATT AAT GAA GAG GAT AAT GAG ATA AGT ATG TTG CAA CAG 82 0 

Asn He Gin He Asn Glu Glu Asp Asn Glu He Ser Met Leu Gin Gin 
225 230 235 240 

AAA GAG AGA GAG TTT CAA GAG GTG TAATT 84 9 

Lys Glu Arg Glu Phe Gin Glu Val 
245 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Lys Lys Thr Gin Thr Trp He He Thr Cys He Tyr Leu Gin Leu 
-25 -20 -15 -10 
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Leu Leu Phe Asn Pro Leu Val Lys Thr Gin Glu He Cys Arg Asn Pro 
-5 15 

Val Thr Asp Asn Val Lys Asp He Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Asn Asp Tyr Met He Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp Leu Arg Asp Met Val Thr His Leu Ser Val Ser 
40 45 50 55 

Leu Thr Thr Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
60 65 70 

Asn Tyr Ser He He Asp Lys Leu Gly Lys He Val Asp Asp Leu Val 
75 80 85 

Ala Cys Met Glu Glu Asn Ala Pro Lys Asn Val Lys Glu Ser Leu Lys 
90 95 100 

Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe Phe Ser He Phe 
105 110 115 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr 
120 125 130 135 

Ser Asp Cys Val Leu Ser Ser Thr Leu Gly Pro Glu Lys Asp Ser Arg 
140 145 150 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
155 160 165 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ala Lys Ser Pro 
170 175 180 

Glu Asp Pro Gly Leu Gin Trp Thr Ala Met Ala Leu Pro Ala Leu He 
185 190 195 

Ser Leu Val He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Lys 
200 205 210 215 

Gin Ser Ser Leu Thr Arg Ala Val Glu Asn He Gin He Asn Glu Glu 
220 225 230 

Asp Asn Glu He Ser Met Leu Gin Gin Lys Glu Arg Glu Phe Gin Glu 
235 240 245 

Val 

(2) INFORMATION FOR SEQ ID NO : 4 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: j oin ( 90 5 . . 1 0 1 8 , 1914.. 1976, 2572.. 2742, 3152 
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. . 3307, 3513. . 3595) 

(ix) FEATURE: 

(A) NAME/KEY: inat_peptide 

(B) LOCATION: j oin ( 965 . . 10 1 8 , 1914.. 1976, 2572.. 2742, 3152 

. . 3307, 3513, . 3595) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

CACAAGTGAG TAGGGCGCGC CCGGGAGCTC CCAGGCTCTC CAGGAAAAAT CGCGCCCGGT 60 

GCCCCGGGGA AGCCGGCGCT CCCTGGGACT TGCAGCTGGG GCGTGCAGGG CTGTGCCTGC 120 

CGGGTGAGAT ACTACAAAGA TAAATCAGTT GCACAAGTTC TTGAAACTCT ACAGTGTAAT 180 

AAGGAAAAAT AAGTCATGCA TAAAAGCAAC TATAATACAT AATAGT^AAAT GTTATTTTCA 240 

AGCCGATGTG TAGGTTATGT GTGTTCGAGA GAGAGAGAGA GAAGACAGAT TACTTTCTGC 300 

TAGGGTTCAA GAATGCCTTC CTGTTGGCTA AGGAAATATT TTCCTTAAGT GGCTAAAAAG 3 60 

CTGTGTTTCA AAATATTCTT TTGATGTCTC ACAAATTCAG TGGAATTCTC TTAGGTCTAA 4 20 

AAATATACAT CTCTCTCACT TTAACTTGGT GTGCTATTGT AGATTATTGG ATTAAAGCAC 4 80 

TGCTCAGGGA TTATGCTGCT TCTTGCCAAG CAGTCTACAT TTAAAGTAGA AATAAGATGT 54 0 

TTCTTTTGGT GCCATAAGGT ATACATTTTA TGCATTCTCT AGTTTTTAGA AGATACCCTA 600 

AGGGCTAAGT CTTTAACATG CTGCTACAAG TTTATTCCTA ATTGCCATTG GGAAATTGGC 660 

TGAAGAAAGT TTTTAACAAA AGTTAACAAT ATTGTCATTG AGAGAATAAT TCAAAATGGA 720 

TTTTAACTAA AAGCTTTTAA AAACTTTGGT GAGCATAGCT TGAATGCGTA ATATTTAATT '780 

GCATTTAAGC CAATAACATA TATTAGACTG GTCTTTTTGT GCATCAAGGC ATTAGATGTT ,840 

AAAAGTTTGA ATGATTACAG ATCTTAACTG ATGATCACCA AGCAATTTTT CTGTTTTCAT r '^900 

TTAG ACT TGG ATT CTC ACT TGC ATT TAT CTT CAG CTG CTC CTA TTT AAT 94 9 
Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu Leu Leu Phe Asn 
-20 -15 -10 

CCT CTC GTC AAA ACT GAA GGG ATC TGC AGG AAT CGT GTG ACT AAT AAT 997 
Pro Leu Val Lys Thr Giu Gly lie Cys Arg Asn Arg Val Thr Asn Asn 
-5 15 10 

GTA AAA GAC GTC ACT AJ\A TTG GTAAGTAAGG AATGCTTTAC CGTGCTGTGT 104 8 
Val Lys Asp Val Thr Lys Leu 
15 

AAAAAAGAGC TGTGGCTCTT TTTCCTGTGC TTGTTGATAA AAGATTTAGA TTTTTCTTGC 1108 

CCCAAAGTAA TGTTTTCCTA AAGTGGGGAA AGTAATCACT GGGTTACAAT AAAGGGTTTA 1168 

TAGAAAGCAG GTAGTGAGAT ATTTAGGGTC ATGGATAATT TGTTGGTAAA ACTGGCTAGT 1228 

TGCACACCAC TGCTGTGACT GCTTCTTTGC TGGTCTTCTC CCCATCCTTC ATAGGCAGTG 12 8 8 

AAGGACCTTG GAGAGTTCGC TGTGTGCTGA TGGGCTTGCC CCAGCTTGTT CCCCATAATC 134 8 

TCTCCAGTGG GTTTCCCAGC ATGTTCTATT CCCCTTCACA TGTCTTCCTA CTCTTCTTTA 1408 
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AAAAGCCTAA CGAAAGGAAA TCTGAAATGG CTATTCTCCC AATTCT^TCA GCAGGAAGAC 14 68 

CCTGTCACAT GTCAGTGGGT GTTTGCTCCT TCAGGGAACA TAGAGAGGTG ATTCATTGCC 1528 

CACATGTTGA AGGGACTCAT CTCCCTGGTT TGTCACATTG AACTCTTCCC TCAGCGAAAG 158 8 

CATTTGCATT GCTTCCCGAA TTCCAAGATC ACAGGTGGAA GCTGAAATTC AGATCATGTT 164 8 

TCCAAAACTC AGTAGGTTAT "ACCTAGCCAG GCATAACTGA ATTTGGAGTC TAAAAGATCT 1708 

GTATTATCAC TTTTTTATTT TGAAGGATGC CTTTTGATTA CAGAGGGAAA TCAAGGATTA 17 68 

AAAATCAATA TACATGTAAA TATTGAAATT CATTGGTAAC TTTAAAAAGC ACAACAGTTT 1828 

TGTGTGCTTT TCTCCAAAGC ACTACAAATA TGATTAATTG ATGTATAAGA ATTTTCTTAT 188 8 

GGAATTTTTT TTTTTGTCTC TGTAG GTG GCA AAT CTT CCA AAA GAC TAC ATG 194 0 

Val Ala Asn Leu Pro Lys Asp Tyr Met 
20 25 

ATA ACC CTC AAA TAT GTC CCC GGG ATG GAT GTT TTG GTATGTAAAC 198 6 
lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
30 35 

TACATTTCTG AGTTTCATTT TAGTAGCTCA TAGAAGAAAT GGGATCATTC ATATTGAGAT 204 6 

AGTACACTAG CTGCTATTTA GGAGCTTGCT TATTGTCAGG ATTTG7U\GAA TTTATCTTTG .210 6 

GAATTTGACT TGCAGGCTTT TTTTTCCCCC TCTTCCTGTT ACAAGAGTCC CTCCTCCTAT 2166 

TACAATAGTC CCTCCTCCTC CTGTCACACT AGTCCCTTCT CTTCCTGTTA CAATAACCCC 2226 

TGTCCTCCTA TTACAACATT TTAAGTAATG TAATATTAAT TTTAAAAATC TGGCCAGGCA 228 6 

CGGTGGTTCA TGCTTGTAAT CCCAGCACAT TGGGAAGCTG AGACGGGTGG ATCATTTGAG 234 6 

GTCAGGAAGT TTGAGACAGC CTGGCCAACA TGGTGAAACT TCCTCTCTAC TAAAAATAAA '24 06 

AAAGTAGCCA GGCATGGTGG CAGGCACTTG TAATCTGAGC TACTCGAGAG GCTGAGGCAG 24 66 

GAGAATCACT TGAGTAACTA AAACGATAGC TTTGAAGAGT ACTCCGAGTT TTATGGCACT 2526 

TACTTATTAA AATAGCTGTT TTGTCTCTTT TTTCATATCT TGCAG CCA AGT CAT 258 0 

Pro Ser His 
40 

TGT TGG ATA AGC GAG ATG GTA GTA CAA TTG TCA GAC AGC TTG ACT GAT 2 628 
Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser Leu Thr Asp 
45 50 55 

CTT CTG GAC AAG TTT TCA AAT ATT TCT GAA GGC TTG AGT AAT TAT TCC 2 67 6 
Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr Ser 
60 65 70 

ATC ATA GAC AAA CTT GTG AAT ATA GTG GAT GAC CTT GTG GAG TGC GTG 2724 
lie lie Asp Lys Leu Val Asn lie Val A„sp Asp Leu Val Glu Cys Val 
75 80 85 90 

AAA GAA AAC TCA TCT AAG GTAACTTTGT GTTCATTGGG ATTATTTTTC 2772 
Lys Glu Asn Ser Ser Lys 
95 

ATTACGCTTC TCTAAAAACC CATGCTTCTT GGTGCTGTTG GGGAAAATGA GGCACCTTTA 2832 
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TTTATGATAT TTTGATTGTA TAAACTTCAA ATTTAAAAAT CTTGTTCAGA TGAGCAAAGA 28 92 

AAACAAGTAT TTGCAGTTAT ACTGCAATAC TGAAGTGCAC ATTCTTGTGT TCACTGCCCC 2 952 

AGATTCAACT TGTGATCCCA CTGGGATCAC TACCCTGCAT TACCAATCTG AATTACATAC 3012 

GTTAAAACAG CCATCTAAAA GTGCTAGTTG TAAGAGTCTA AATACTTGAA TCTTTGAGAG 3072 

ACATATTTAT AGTCCATTAT CTTCACCTCA GTTAAGTCTG AAGACTATTT GAAAAATGTA 3132 

ATCCTATTTT TTCTTCTAG GAT CTA AAA AAA TCA TTC AAG AGO CCA GAA CCC 318 4 

Asp Leu Lys Lys Ser Phe Lys Ser Pro Glu Pro 
100 105 

AGG CTC TTT ACT CCT GAA GAA TTC TTT AGA ATT TTT AAT AGA TCC ATT 3232 
Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe Asn Arg Ser lie 
110 115 120 

GAT GCC TTC AAG GAC TTT GTA GTG GCA TCT GAA ACT AGT GAT TGT GTG 3280 
Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr Ser Asp Cys Val 
125 130 135 

GTT TCT TCA ACA TTA AGT CCT GAG AAA GGTAAGACAT GTAAGCATTT 3327 
Val Ser Ser Thr Leu Ser Pro Glu Lys 
140 145 

CCAGTTCAAA TGTAAACAAC AAACTTAAAT CTTCCCTATG TAGTAAGAAT CTACCTCTGT 338 7 

GTTAAGCTGT AGCAAGATAC ATGCATGTAC GTCTAATAAA AAAGCAGATA TCAATAGCAC 34 4 7 

AGAAGAAACT CTATAACTCA TACAAATCAC CATATAACAC TGACACATTA TTGCTTTCTA 3507 

TTTAG ATT CCA GAG TCA GTG TCA CAA AAC CAT TTA TGT TAG CCC CTG 3554 
lie Pro Glu Ser Val Ser Gin Asn His Leu Cys Tyr Pro Leu 
150 155 160 

TTG CAG CCA GCT CCC TTA GGA ATG ACA GCA GTA GCA GTA ATAGTAAGTA 3603 
Leu Gin Pro Ala Pro Leu Gly Met Thr Ala Val Ala Val 
165 170 175 

CATATATCTG ATTTAATGCA TGCATGGCTC CAATTAGCAC CTATAGGAGT ATTGCATGGG 3663 

CTTTCAAGGA AACTTCTACA TTTATTATTA TTGATACTGT TCTGTTACTG TTATTCCTTT 3723 

TATGGTCTTC TTGAGACTTA AGTTTGTAGA ATTAAATTTC CCTAGAGCTG GAGATAATGT 37 8 3 

TTAGAGAATT AGGCCAATAA ATTT 3807 

(2) INFORMATION FOR SEQ ID NO : 4 4 ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu Leu Leu Phe Asn Pro 
-20 -15 -10 -5 
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Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg Val Thr Asn Asn Val 

15 10 

Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro Lys Asp Tyr Met lie 
15 20 25 

Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu Pro Ser His Cys Trp 
30 35 40 

lie Ser Glu Met Val Val Gin Leu Ser Asp Ser Leu Thr Asp Leu Leu 
45 50 55 60 

Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr Ser lie lie 
65 70 75 

Asp Lys Leu Val Asn lie Val Asp Asp Leu Val Glu Cys Val Lys Glu 
80 85 90 

Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys Ser Pro Glu Pro Arg 
95 100 105 

Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe Asn Arg Ser lie Asp 
110 115 120 

Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr Ser Asp Cys Val Val 
125 130 135 140 

Ser Ser Thr Leu Ser Pro Glu Lys lie Pro Glu Ser Val Ser Gin Asn 
145 150 155 

His Leu Cys Tyr Pro Leu Leu Gin Pro Ala Pro Leu Gly Met Thr Ala 
160 165 170 

Val Ala Val 
175 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17. , 640 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 92.. 640 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

AAGCTTGCCT TTCCTT ATG AAG AAG ACA CAA ACT TGG ATT CTC ACT TGC 4 9 

Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys 
-25 -20 -15 

ATT TAT CTT CAG CTG CTC CTA TTT AAT OCT CTC GTC AAA ACT GAA GGG 97 
lie Tyr Leu Gin Leu Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly 
-10 -5 1 
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ATC TGC AGG AAT CGT GTG ACT AAT AAT GTA AAA GAG GTC ACT AAA TTG 14 5 

He Cys Arg Asn Arg Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu 
5 10 15 

GTG GCA AAT CTT CCA AAA GAC TAG ATG ATA ACC CTC AAA TAT GTC CCC 193 
Val Ala Asn Leu Pro Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro 
20 25 30 

GGG ATG GAT GTT TTG CCA AGT CAT TGT TGG ATA AGC GAG ATG GTA GTA 241 
Gly Met Asp Val Leu Pro Ser His Cys Trp He Ser Glu Met Val Val 
35 40 45 50 

CAA TTG TCA GAC AGC TTG ACT GAT CTT CTG GAC AAG TTT TCA AAT ATT 28 9 

Gin Leu Ser Asp Ser Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He 
55 60 65 

TCT GAA GGG TTG AGT AAT TAT TCC ATC ATA GAC AAA CTT GTG AAT ATA 337 
Ser Glu Gly Leu Ser Asn Tyr Ser He He Asp Lys Leu Val Asn He 
70 75 80 

GTG GAT GAC CTT GTG GAG TGC GTG AAA GAA AAC TCA TCT AAG GAT GTA 385 
Val Asp Asp Leu Val Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu 
85 90 95 

AAA AAA TCA TTC AAG AGC CCA GAA CCC AGG CTC TTT ACT CCT GAA GAA 4 33 

Lys Lys Ser Phe Lys Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu 
100 105 110 

TTC TTT AGA ATT TTT AAT AGA TCC ATT GAT GCC TTC AAG GAC TTT GTA 4 81 

Phe Phe Arg He Phe Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val 

120 125 130 

GTG GCA TCT GAA ACT AGT GAT TGT GTG GTT TCT TCA ACA TTA AGT CCT 52 9 

Val Ala Ser Glu Thr Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro 
135 140 145 

GAG AAA GAT TCC AGA GTC AGT GTC ACA AAA CCA TTT ATG TTA CCC CCT 577 
Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro 
150 155 160 

GTT GCA GCC AGC TCC CTT AGG AAT GAC AGC AGT AGC AGT AAT AGT AAG 625 
Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Ser Lys 
165 170 175 

TAC ATA TAT CTG ATT TAATGCATGC ATGGCTCCAA TTAGCACCTA TAGGAGTATT 680 
Tyr He Tyr Leu He 
180 

GCATGGGCTT TCAAGGAAAC TTCTACATTT ATTATTATTG ATACTGTTCT GTTACTGTTA 74 0 

TTCCTTTTAT GGTCTTCTTG AGACTTAAGT TTGTAGAATT AAATTTCCCT AGAGCTGGAG 800 
ATAATGTTTA GAGAATTAGG 320 
(2) INFORMATION FOR SEQ ID NO : 4 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 6 : 

Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu 
-25 -20 -15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg 
-5 15 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Lys Asp Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser 
40 45 50 55 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
60 65 70 

Asn Tyr Ser lie lie Asp Lys Leu Val Asn lie Val Asp Asp Leu Val 
75 80 85 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
90 95 100 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe 
105 110 115 

Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
120 125 130 135 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
140 145 150 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
155 160 165 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Ser Lys Tyr lie Tyr Leu lie 
170 175 180 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: j oin ( 565 . . 57 9, 1684.. 1797, 2693.. 2755, 3351.. 3521, 

3932. .4088, 4314. .4397, 4778.-4887, 5208. .5275, 
5677 . .5713) 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: j oin ( 17 4 4 . . 17 97 , 2693.-2755, 3351.. 3521, 3932 

..4088, 4314. .4397, 4778. .4887, 5208. .5275, 5677 
. . 5713) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 7 : 



GAGCTCCGAG 


CCCTCTCTGG 


CGCGCGAGGT 


ATTTCGTCTG 


TNCCCGGGGG 


TGCCAGGTGA 


60 


GCCCCAGCGG 


ATCCGGGAGG 


GTAAGCTGGG 


ACTCCTCGCG 


AGCAGTAGCT 


GCAGGGTACC 


120 


AAGCTTCGCC 


CTCTGCGTCC 


CCGCGCCTTC 


GCGGTCTCCC 


GCCAGTGCAG 


GTCCGGGGCC 


180 


CCCAGGCGAG 


CGGACAAGGT 


TGGCCTAATC 


TGCCAAACTT 


CTGGGGCATT 


TACCGTGCTC 


240 


TGGCCGCCCT 


CCCGATTCTT 


CCCTCCGCGC 


CCTTGCCTGC 


TTCTCGCCTA 


CCCCGGGCTC 


300 


CGGAAGGGAA 


GGAGGCGTGT 


CCGGAGCAGG 


CGGGCGGGAA 


CTGTATAAAA 


GCGCCGGCGG 


360 


CTCAGCAGCC 


GGCTTCGCTC 


GCCGCCTCGC 


GCCGAGACTA 


GAAGCGCTGC 


GGGAAGCAGG 


420 


GACAGTGGAG 


AGGGCGCTGC 


GCTCGGGCTA 


CCCAATGCGT 


GGACTATCTG 


CCGCCGCTGT 


480 


TCGTGCAATA 


TGCTGGAGCT 


CCAGAACAGC 


TAAACGGAGT 


CGCCACACCA 


CTGTTTGTGC 


540 


TGGATCGCAC 


CGCTGCCTTT 


CCTT ATG AAG AAG ACA CAA GTGAGTAGGG 
Met Lys Lys Thr Gin 
-25 


589 


CGCGCCCGGG 


AGCTCCCAGG 








(a G Cj AA G C C G 


649 


GCGCTCCCTG 


GGACTTGCAG 








TV 7\ TV TV TV 

GAGACAAGAG 


709 


GATGCGGGGG 


AGGCCGGCGT 




^ O L- vjj/A.tj U 




C LAG G G Atd AA 


769 


AAGGAGTGGG 


AGTACTGAGA 


GGGAGCCAGT 


GTCAAGTTTG 


GAGCCTCAGC 


AGTTAAGTTT 


829 


TGAGCTGTCA 


GTCGGAAACC 


GTAATTCCCG 


TCTGGTGGAA 


AGATTGGCTT 


TTNGNCCACG 


889 


GAATGTAAGT 


TATCACAGAT 


ACTACAAAGA 


TAAATCAGTT 


GCACAAGTTC 


TTGAAACTCT 


949 


ACAGTGTAAT 


AAGGAAAAAT 


AAGTCATGCA 


TAAAAGCAAC 


TATAATACAT 


AATAGAAAAT 


1009. 


GTTATTTTCA 


AGCCGATGTG 


TAGGTTATGT 


GTGTTCGAGA 


GAGAGAGAGA 


GAAGACAGAT 


1069 


TACTTTCTGC 


TAGGGTTCAA 


GAATGCCTTC 


CTGTTGGCTA 


AGGAAATATT 


TTCCTTAAGT 


1129 


GGCTAAAAAG 


CTGTGTTTCA 


AAATATTCTT 


TTGATGTCTC 


ACAAATTCAG 


TGGAATTCTC 


1189 


TTAGGTCTAA 


AAATATACAT 


CTCTCTCACT 


TTAACTTGGT 


GTGCTATTGT 


AGATTATTGG 


1249 


ATTAAAGCAC 


TGCTCAGGGA 


TTATGCTGCT 


TCTTGCCAAG 


CAGTCTACAT 


TTAAAGTAGA 


1309 


AATAAGATGT 


TTCTTTTGGT 


GCCATAAGGT 


ATACATTTTA 


TGCATTCTCT 


AGTTTTTAGA 


1369 


AGATACCCTA 


AGGGCTAAGT 


CTTTAACATG 


CTGCTACAAG 


TTTATTCCTA 


ATTGCCATTG 


1429 


GGAAATTGGC 


TGAAGAAAGT 


TTTTAACAAA 


AGTTAACAAT 


ATTGTCATTG 


AGAGAATAAT 


1489 


TCAAAATGGA 


TTTTAACTAA 


AAGCTTTTAA 


AAACTTTGGT 


GAGCATAGCT 


TGAATGCGTA 


1549 


ATATTTAATT 


GCATTTAAGC 


CAATAACATA 


TATTAGACTG 


GTCTTTTTGT 


GCATCAAGGC 


1609 


ATTAGATGTT 


AAAAGTTTGA 


ATGATTACAG 


ATCTTAACTG 


ATGATCACCA 


AGCAATTTTT 


1669 



CTGTTTTCAT TTAG ACT TGG ATT CTC ACT TGC ATT TAT CTT CAG CTG CTC 1719 

Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu Leu 
-20 -15 -10 
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CTA TTT AAT CCT CTC GTC AAA ACT GAA GGG ATC TGC AGG AAT CGT GTG 17 67 

Leu Phe Asn Pro Leu Val Lys Thr Glu Gly He Cys Arg Asn Arg Val 
-5 15 

ACT AAT AAT GTA AAA GAC GTC ACT AAA TTG GTAAGTAAGG AATGCTTTAC 1817 
Thr Asn Asn Val' Lys Asp Val Thr Lys Leu 
10 15 



CGTGCTGTGT 


AAAAAAGAGC 


TGTGGCTCTT 


TTTCCTGTGC 


•TTGTTGATAA 


AAGATTTAGA 


1877 


TTTTTCTTGC 


CCCAAAGTAA 


TGTTTTCCTA 


AAGTGGGGAA 


AGTAATCACT 


GGGTTACAAT 


1937 


AAAGGGTTTA 


TAGAAAGCAG 


GTAGTGAGAT 


ATTTAGGGTC 


ATGGATAATT 


TGTTGGTAAA 


1997 


ACTGGCTAGT 


TGCACACCAC 


TGCTGTGACT 


GCTTCTTTGC 


TGGTCTTCTC 


CCCATCCTTC 


2057 


ATAGGCAGTG 


AAGGACCTTG 


GAGAGTTCGC 


TGTGTGCTGA 


TGGGCTTGCC 


CCAGCTTGTT 


2117 


CCCCATAATC 


TCTCCAGTGG 


GTTTCCCAGC 


ATGTTCTATT 


CCCCTTCACA 


TGTCTTCCTA 


2177 


CTCTTCTTTA 


AAAAGCCTAA 


CGAAAGGAAA 


TCTGAAATGG 


CTATTCTCCC 


AATTCAATCA 


2237 


GCAGGAAGAC 


CCTGTCACAT 


GTCAGTGGGT 


GTTTGCTCCT 


TCAGGGAACA 


TAGAGAGGTG 


2297 


ATTCATTGCC 


CACATGTTGA 


AGGGACTCAT 


CTCCCTGGTT 


TGTCACATTG 


AACTCTTCCC 


2357 


TCAGCGAAAG 


CATTTGCATT 


GCTTCCCGAA 


TTCCAAGATC 


ACAGGTGGAA 


GCTGAAATTC 


2417 


AGATCATGTT 


TCCAAAACTC 


AGTAGGTTAT 


ACCTAGCCAG 


GCATAACTGA 


ATTTGGAGTC 


2477 


TAAAAGATCT 


GTATTATCAC 


TTTTTTATTT 


TGAAGGATGC 


CTTTTGATTA 


CAGAGGGAAA 


2537 


TCAAGGATTA 


AAAATCAATA 


TACATGTAAA 


TATTGAAATT 


CATTGGTAAC 


TTTAAAAAGC 


2597 


ACAACAGTTT 


TGTGTGCTTT 


TCTCCAAAGC 


ACTACAAATA 


TGATTAATTG 


ATGTATAAGA 


2657 


ATTTTCTTAT 


GGAATTTTTT 


TTTTTGTCTC 


TGTAG GTG GCA AAT CTT 


CCA AAA 


2710 



Val Ala Asn Leu Pro Lys 
20 



GAC TAC ATG ATA ACC CTC AAA TAT GTC CCC GGG ATG GAT GTT TTG 
Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 . 



GTATGTAAAC 


TACATTTCTG 


AGTTTCATTT 


TAGTAGCTCA 


TAGAAGAAAT 


GGGATCATTC 


2815 


ATATTGAGAT 


AGTACACTAG 


CTGCTATTTA 


GGAGCTTGCT 


TATTGTCAGG 


ATTTGAAGAA 


2875 


TTTATCTTTG 


GAATTTGACT 


TGCAGGCTTT 


TTTTTCCCCC 


TCTTCCTGTT 


ACAAGAGTCC 


2935 


CTCCTCCTAT 


TACAATAGTC 


CCTCCTCCTC 


CTGTCACACT 


AGTCCCTTCT 


CTTCCTGTTA 


2995 


CAATAACCCC 


TGTCCTCCTA 


TTACAACATT 


TTAAGTAATG 


TAATATTAAT 


TTTAAAAATC 


3055 


TGGCCAGGCA 


CGGTGGTTCA 


TGCTTGTAAT 


CCCAGCACAT 


TGGGAAGCTG 


AGACGGGTGG 


3115 


ATCATTTGAG 


GTCAGGAAGT 


TTGAGACAGC 


CTGGCCAACA 


TGGTGAAACT 


TCCTCTCTAC 


3175 


TAAAAATAAA 


AAAGTAGCCA 


GGCATGGTGG 


CAGGCACTTG 


TAATCTGAGC 


TACTCGAGAG 


3235 


GCTGAGGCAG 


GAGAATCACT 


TGAGTAACTA 


AAACGATAGC 


TTTGAAGAGT 


ACTCCGAGTT 


3295 


TTATGGCACT 


TACT TAT TAA 


AATAGCTGTT 


TTGTCTCTTT 


TTTCATATCT 


TGCAG CCA 


3353 
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Pro 
40 

AGT CAT TGT TGG ATA AGO GAG ATG GTA GTA CAA TTG TCA GAG AGO TTG 34 01 

Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser Leu 
45 50 55 

ACT GAT CTT CTG GAG AAG TTT TCA AAT ATT TCT GAA GGC TTG AGT AAT 34 4 9 

Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Giy Leu Ser Asn 
60 65 70 

TAT TCC ATC ATA GAG AAA CTT GTG AAT ATA GTG GAT GAC CTT GTG GAG 34 97 

Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu Val Glu 
75 80 85 

TGC GTG AAA GAA AAC TCA TCT AAG GTAACTTTGT GTTCATTGGG ATTATTTTTC 3551 
Cys Val Lys Glu Asn Ser Ser Lys 
90 95 

ATTACGCTTC TCTAAAAACC CATGCTTCTT GGTGCTGTTG GGGAAAATGA GGCACCTTTA 3611 

TTTATGATAT TTTGATTGTA TAAACTTCAA ATTTAAAAAT CTTGTTCAGA TGAGCAAAGA 3671 

AAACAAGTAT TTGCAGTTAT ACTGCAATAC TGAAGTGCAC TATTCTTGTG TTCACTGCCC 3731 

CAGATTCAAC TTGTGATCCC ACTGGGATCA CTACCCTGCA TTACCAATCT GAATTACATA 37 91 

CGTTAAAACA GCCATCTAAA AGTGCTAGTT GTAAGAGTCT AAATACTTGA ATCTTTGAGA 38 51 

GACATATTTA TAGTCCATTA TCTTCACCTC AGTTAAGTCT GAAGACTATT TGAAA^^ATGT 3911 

AATCCTATTT TTTCTTCTAG GAT CTA AAA AAA TCA TTC AAG AGC CCA GAA 3961 

Asp Leu Lys Lys Ser Phe Lys Ser Pro Glu 
100 105 

CCC AGG CTC TTT ACT CCT GAA GAA TTC TTT AGA ATT TTT .AAT AGA TCC 4 009 

Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe Asn Arq Ser 
110 115 120 

ATT GAT GCC TTC AAG GAC TTT GTA GTG GCA TCT GAA ACT AGT GAT TGT 4 057 

He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr Ser Asp r^vs 
125 130 - 135 

GTG GTT TCT TCA ACA TTA AGT CCT GAG AAA G GTAAGACATG TAAGCATTTC 4108 
Val Val Ser Ser Thr Leu Ser Pro Glu Lys 
140 145 

CAGTTCAAAT GTAAACAACA AACTTAAATC TTCCCTATGT AGTAAGAATC TACCTCTGTG 4168 

TTAAGCTGTA GCAAGATACA TGCATGTACG TCTAAAAAAA AGCAGATATC AATAGCACAG 4 228 

AAGAAACTAA TGATTGTAGA TTTGTGGGCT CTATAACTCA TACAAATCAC CATATAACAC 4 28 8 

TGACACATTA TTGCTTTCTA TTTAG AT TCC AGA GTC AGT GTC ACA AAA CCA 4 33 9 

Asp Ser Arg Val Ser Val Thr Lys Pro 
150 155 

TTT ATG TTA CCC CCT GTT GCA GCC AGC TCC CTT AGG AAT GAC AGC AGT 4 387 

Phe Met Leu Pro Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser 
160 165 170 
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AGC AGT AAT A GTAAGTACAT ATATCTGATT TAATGCATGC ATGGCTCCAA 4 4 37 
Ser Ser Asn 
175 

TTAGCACCTA TAGGAGTATT GCATGGGCTT TCAAGGAAAC TTCTACATTT ATTATTATTG 4 4 97 

ATACTGTTCT GTTACTGTTA TTCCTTTTAT GGTCTTCTTG AGACTTAAGT TTGTAGAATT 4 5 57 

AAATTTCCCT AGAGCTGGAG ATAATGTTTA GAGAATTAGG CCAATAAATT TTCTGCTGAG 4 617 

GTTATTTTAA ATAAGACATA AAATTAATTT TAGAAATATG ATTTATGCCT TTTGTTGAAT 4 677 

CATTAACATA TATACAGAAA CAGTTAT^AAC AACCACAGCA TAAGAGAAAA ACTTCTAGAA 4 7 37 

TGGATATGCT GTATTCATCA GTGTGTTCTT TAAATTATAG GG AAG GCC AAA AAT 4 7 91 

Arg Lys Ala Lys Asn 
180 

CCC COT GGA GAG TCC AGC CTA CAC TGG CCA GCC ATG GCA TTG CCA GCA 4 8 39 
Pro Pro Gly Asp Ser Ser Leu His Trp Pro Ala Met Ala Leu Pro Ala 
185 190 195 

TTG TTT TCT CTT ATA ATT GGC TTT GCT TTT GGA GCC TTA TAG TGG AAG 4 8 87 
Leu Phe Ser Leu lie lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys 
200 205 210 

GTAAGTGGTA CCATTCCTTT TTTTAAAAAT ATGCTATGTT TACATAAATT ATCATCTTTT 4 94 7 

TTTCCTCAAG AAATGATCCT TTAAGA-AAAC AGTGAATCTA CCTTAGCTTA TACTAAACAA 5007 

AATTTAAATT TTATAAAGTT TCCTGTTTCT CATTATGTCT GGAGACAATC CCTCTAGCTG 50 67 

ATAATTCACG CTTAAGAATT AGGAACTAAA ACTGTTATTG GAGTTATTGC CATAAAAGAT 5127 

AAAAGTGGAG TCCACTTACC TCTTAAATAT TAGACCATTC ATTGATTATT TTACAGTATA 5187 

TGTCTTTCTT CTTTTTCCAG AAG AGA CAG CCA AGT CTT ACA AGG GCA GTT 52 3T 

Lys Arg Gin Pro Ser Leu Thr Arg Ala Val 
215 220 

GAA AAT ATA CAA ATT AAT GAA GAG GAT AAT GAG ATA AG GTATTTTGTT '5285 
Glu Asn lie Gin lie Asn Glu Glu Asp Asn Glu lie Ser 
225 230 235 

TTGCTAAATG TGTGCCCAAT CAAGCATGAC ATTGCCATTT CACACACTGT GTACCTGCCC 534 5 

ATAATGTCTT TAAGAAGTCC TTCACTCATG ACAGTAGCTC CTAACCAGTG AGTCCCAACT 54 05 

CTATCCATGT TTCTGATGTC TCACTCTCTC TTCGTATGTG TATATGCATA TACAGAGAAA 54 65 

GAAATGTTTT AACTACTTGG AAAGACTACC TTAAGACAAA TGAAGTCTTC CCTCTTCCCT 5525 

ATAGTAATAA GAAGGTAGGC TCCCCCATTC AATTTTGCAA TCTTCTGCTA CTATATTTAC 5585 

AGAAAAGCTG CCTTTTAC/UV TGCCGAGATC ATGGTGTACC TCAGAATCTC TGACCAAGAG 5 64 5 

CAAATAAGCA TTTTTTCTTA TTGTTTTTCA G T ATG TTG CAA GAG AAA GAG AGA 5 698 

Met Leu Gin Glu Lys Glu Arg 
240 

GAG TTT CAA GAA GTG TAATTGTGGC TTGTATCAAC ACTGTTACTT TCGTACATTG 57 53 
Glu Phe Gin Glu Val 
245 
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GTAAGTTTTT TTCTTCTTTC CTTTTTTTTT CTTTTTTTTA TTATACTTTA AGTTCTAGGG 5813 
TACATGTGCA CAATGTGCAG GTTTGTTACG TATGTTTACA TGTGCCATGT T 58 64 

(2) INFORMATION FOR SEQ ID NO : 4 8 : 

(i) SEQUENCE CHAEIACTERISTICS : 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Met Lys Lys Thr Gin Thr Trp He Leu Thr Cys He Tyr Leu Gin Leu 
-25 -20 -15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly He Cys Arg Asn Arg 
-5 1 5 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp He Ser Glu Met Val Val Gin Leu Ser Asp Ser 
40 45 50 55 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
60 65 70 

Asn Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu Val 
75 80 85 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lvs 
^0 95 100 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arq He Phe 
105 110 115 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
120 125 130 135 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
140 145 150 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
155 160 165 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro Pro 
170 175 180 

Gly Asp Ser Ser Leu His Trp Pro Ala Met Ala Leu Pro Ala Leu Phe 
185 190 195 

Ser Leu He He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lvs Aro 
200 205 210 215 

Gin Pro Ser Leu Thr Arg Ala Val Glu Asn He Gin He Asn Glu Glu 
220 225 230 
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Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
235 240 245 

Val 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg 
20 25 30 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Lys Asp Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
50. 55 60 

Pro Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser 
65 70 75 80 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser He lie Asp Lys Leu Val Asn He Val Asp Asp Leu Val 
100 105 110 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
115 120 125 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe 
130 135 140 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
145 150 155 160 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro Pro 
195 200 205 

Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu Pro Ala Leu Phe 
210 215 220 

Ser Leu He He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Arg 
225 230 235 240 



29 



Gin Pro Ser Leu Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu Glu 
245 250 255 

Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
260 265 270 

Val 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg 
20 25 30 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Lys Asp Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser 
65 70 75 80 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser lie lie Asp Lys Leu Val Asn lie Val Asp Asp Leu Val 
100 105 110 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
115 120 125 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe 
130 135 140 

Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Ala Val Ala Ser Glu Thr 
145 150 155 160 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro Thr 
195 200 205 

Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu Pro Ala Phe Phe 
210 215 220 
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Ser Leu lie lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Arg 
225 230 235 240 

Gin Pro Ser Leu Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu Glu 
245 250 255 

Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
260 265 270 

Val 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Lys Lys Thr Gin Thr Trp He He Thr Cys He Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Lys Gly He Cys Gly Lys Arg 
20 25 30 

Val Thr Asp Asp Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Lys Asp Tyr Lys He Ala Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp He Ser Val Met Val Glu Gin Leu Ser Val Ser 
65 70 75 80 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser He He Asp Lys Leu Val Lys He Val Asp Asp Leu Val 
100 105 • 110 

Glu Cys Thr Glu Gly Tyr Ser Phe Glu Asn Val Lys Lys Ala Pro Lys 
115 120 125 

Ser Pro Glu Leu Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe 
130 135 140 

Asn Arg Ser He Asp Ala Phe Lys Asp Leu Glu Thr Val Ala Ser Lys 
145 150 155 160 

Ser Ser Glu Cys Val Val Ser Ser Thr Leu Ser Pro Asp Lys Asp Ser 
165 170 175 

Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser 
180 185 190 

Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ser Asn Ser 
195 200 205 
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lie Gly Asp Ser Asn Leu Gin Trp Ala Ala Met Ala Leu Pro Ala Phe 
210 215 220 

Phe Ser Leu Val lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys 
225 230 235 240 

Lys Gin Pro Asn Leu Thr Arg Thr Val Glu Asn lie Gin lie Asn Glu 
245 250 255 

Glu Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin 
260 265 270 

Glu Val 

(2) INFORMATION FOR SEQ ID NO: 52: ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Lys Thr Gin Thr Trp lie Val Thr Cys lie Tyr Leu Gin Leu Leu 
15 10 15 

Phe Asn Pro Leu Val Lys Thr Lys Gly Leu Cys Arg Asn Arg Val Thr 
20 25 30 

Asp Asp Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro Lys Asp 
35 40 45 

Tyr Lys lie Ala Leu Lys Tyr Val Pro Gly Met Asp Val Leu Pro Ser 
50 55 60 

His Cys Trp lie Ser Val Met Val Glu Gin Leu Ser Val Ser Leu Thr 
65 70 75 80 

Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr 
85 90 95 

Ser lie lie Asp Lys Leu Val Lys lie Val Asp Asp Leu Val Glu Cys 
100 105 110 

Val Glu Gly His Ser Ser Glu Asn Val Lys Lys Ser Ser Lys Ser Pro 
115 120 125 

Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe Asn Arg 
130 135 140 

Ser lie Asp Ala Phe Lys Asp Leu Glu Met Val Ala Ser Lys Thr Ser 
145 150 155 160 

Glu Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg Val 
165 170 175 

Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser Leu 
180 185 190 
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Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Thr Asn Pro lie Glu Asp 
195 200 205 

Ser Ser He Gin Trp Ala Val Met Ala Leu Pro Ala Cys Phe Ser Leu 
210 215 220 

Val He Gly Phe Ala Phe Gly Ala Phe Tyr Trp Lys Lys Lys Gin Pro 
225 230 235 240 

Asn Leu Thr Arg Thr Val Glu Asn He Gin- He Asn Glu Glu Asp Asn 
245 250 255 

Glu He Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu Val 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Met Lys Lys Thr Gin Thr Trp He He Thr Cys He Tyr Leu Gin Leu • 
15 10 15 

Leu Leu Phe Asn Pro Leu Val His Thr Gin Gly He Cys Ser Asn Arq 
20 25 30 

Val Thr Asp Asp Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu- 
50 55 60 

Pro Ser His Cys Trp He Ser Glu Met Val Glu Gin Leu Ser Val Ser 
^5 70 75 80 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Cys He He Asp Lys Leu Val Lys He Val Asp Asp Leu Val 
100 105 110 

Glu Cys Met Glu His Ser Ser Glu Asn Val Lys Lys Ser Ser Lys Ser 
115 120 125 

Pre Glu Pro Arg Gin Phe Thr Pro Glu Lys Phe Phe Gly He Phe Asn 
130 135 140 

Lys Ser He Asp Ala Phe Lys Asp Leu Glu He Val Ala Ser Lys Met 
145 150 155 160 

Ser Glu Cys Val He Ser Ser Thr Ser Ser Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 



33 



Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ser Asn Ser lie 
195 200 205 

Glu Asp Ser Ser Leu Gin Trp Ala Ala Val Ala Leu Pro Ala Phe Phe 
210 215 220 

Ser Leu Val lie Gly Phe Ala Phe Gly Ala Phe Tyr Trp Lys Lys Lys 
225 230 235 240 

Gin Pro Asn Leu Thr Arg Thr Val Glu Asn Arg Gin lie Asn Glu Glu 
245 250 255 

Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
260 265 270 

Val 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

. Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met Lys Lys Thr Gin Thr Trp lie lie Thr Cys lie Tyr Leu Gin Leu 
- 1 5 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Gin Glu lie Cys Arg Asn Pro 
20 25 30 

Val Thr Asp Asn Val Lys Asp lie Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Asn Asp Tyr Met lie Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp Leu Arg Asp Met Val Thr His Leu Ser Val Ser 
65 70 75 80 

Leu Thr Thr Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser lie lie Asp Lys Leu Gly Lys lie Val Asp Asp Leu Val 
100 105 110 

Ala Cys Met Glu Glu Asn Ala Pro Leu Asn Val Lys Glu Ser Leu Lys 
115 120 125 

Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe Phe Ser lie Phe 
130 135 140 

Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr 
145 150 155 160 

Ser Asp Cys Val Leu Ser Ser Thr Leu Gly Pro Glu Lys Asp Ser Arg 
165 170 175 
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Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ala Lys Ser Pro 
195 200 205 

Glu Asp Pro Gly Leu Gin Trp Thr Ala Met Ala Leu Pro Ala Leu lie 
210 215 220 

Ser Leu Val lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Lys 
225 230 235 240 

Gin Ser Ser Leu Thr Arg Ala Val Glu Asn He Gin He Asn Glu Glu 
245 250 255 

Asp Asn Glu He Ser Met Leu Gin Gin Lys Glu Arg Glu Phe Gin Glu 
260 265 270 

Val 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Lys Lys Thr Gin Thr Trp He He Thr Cys He Tyr Leu Glfi Leu 
1 5 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Lys Glu He Cys Gly Asn Pro' 
20 25 30 

Val Thr Asp Asn Val Lys Asp He Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Asn Asp Tyr Met He Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp Leu Arg Asp Met Val He Gin Leu Ser Leu Ser 

70 75 80 

Leu Thr Thr Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser He He Asp Lys Leu Gly Lys He Val Asp Asp Leu Val 
100 105 110 

Leu Cys Met Glu Glu Asn Ala Pro Lys Asn He Lys Glu Ser Pro Lys 
115 120 125 

Arg Pro Glu Thr Arg Ser Phe Thr Pro Glu Glu Phe Phe Ser He Phe 
130 135 140 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr 
1« 150 155 160 
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Ser Asp Cys Val Leu Ser Ser Thr Leu Gly Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ala Lys Ala Pro 
195 200 205 

Glu Asp Ser Gly Leu Gin Trp Thr Ala Met Ala Leu Pro Ala Leu He 
210 215 220 

Ser Leu Val He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Lys 
225 230 235 240 

Gin Ser Ser Leu Thr Arg Ala Val Glu Asn He Gin He Asn Glu Glu 
245 250 255 

Asp Asn Glu He Ser Met Leu Gin Gin Lys Glu Arg Glu Phe Gin Glu 
260 265 270 

Val 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

Thr Trp He He Thr Cys Phe Cys Leu Gin Leu Leu Leu Leu Asn Pro 
15 10 15 

Leu Val Lys Ala Gin Ser Ser Cys Gly Asn Pro Val Thr Asp Asp Val 
20 25 30 

Asn Asp lie Ala Lys Leu Val Gly Asn Leu Pro Asn Asp Tyr Leu He 
35 40 4S 

Thr Leu Lys Tyr Val Pro Lys Met Asp Ser Leu Pro Asn His Cys Trp 
50 55 60 

Leu His Leu Met Val Pro Glu Phe Ser Arg Ser Leu His Asn Leu Leu 
^5 70 75 80 

Gin Lys Phe Ser Asp He Ser Asp Met Ser Asp Val Leu Ser Asn Tyr 
85 90 95 

Ser He He Asn Asn Leu Thr Arg He He Asn Asp Leu Met Ala Cys 
100 105 110 

Leu Ala Phe Asp Lys Asn Lys Asp Phe He Lys Glu Asn Gly Leu His 
115 120 125 

Tyr Glu Glu Asp Arg Phe He Pro Glu Asn Phe Phe Arg Leu Phe Asn 
130 135 140 
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Ser Thr lie Glu Val Tyr Lys Glu Phe Ala Asp Ser Leu Asp Lys Asn 
145 150 155 160 

Asp Cys lie Met Pro Ser Thr Val Glu Thr Pro Glu Asn Asp Ser Arg 
165 170 175 

Val Ala Val Thr Lys Thr lie Ser Phe Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser lie Gly Ser Asn Thr Ser Ser Asn Ser Asn Lys 
195 200 205 

Glu Ala Leu Gly Phe lie Ser Ser Ser Ser. Leu Gin Gly lie Ser lie 
210 215 220 

Ala Leu Thr Ser Leu Leu Ser Leu Leu lie Gly Phe lie Leu Gly Ala 
' 225 230 235 240 

lie Tyr Trp Lys Lys Thr His Pro Lys Ser Arg Pro Glu Ser Asn Glu 
245 250 255 

Thr lie Gin Cys His Gly Cys Gin Glu Glu Asn Glu lie Ser Met Leu 
260 265 270 

Gin Gin Lys Glu Lys Glu His Leu Gin Val 
275 280 

(2) INFORMATION FOR SEQ ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Met Lys Lys Thr Gin Thr Trp lie lie Thr Cys lie Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Gly lie Cys Arg Asn Arg Val 
20 25 30 

Thr Asp Val Lys Asp Val Thr Lys Leu Val Ala' Asn Leu Pro Lys Asp 
35 40 45 

Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu Pro Ser 
50 55 60 

His Cys Trp lie Ser Glu Met Val Glu Gin Leu Ser Val Ser Leu Thr 
65 70 75 80 

Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr 
85 90 95 

Ser lie lie Asp Lys Leu Val Lys lie Val Asp Asp Leu Val Glu Cys 
100 105 110 

Glu Glu Asn Ser Ser Lys Asn Val Lys Lys Ser Lys Ser Pro Glu Pro 
115 120 125 
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Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe Asn Arg Ser lie 
130 135 140 

Asp Ala Phe Lys Asp Phe Met Val Ala Ser Lys Thr Ser Asp Cys Val 
145 150 155 160 

Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg Val Ser Val Thr 
165 170 175 

Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser Leu Arg Asn Asp 
180 185 ■ 190 

Ser Ser Ser Ser Asn Arg Lys Ala Asn Glu Asp Ser Ser Leu Gin Trp 
195 200 205 

Ala Ala Met Ala Leu Pro Ala Leu Phe Ser Leu Val lie Gly Phe Ala 
, 210 215 220 

Phe Gly Ala Leu Tyr Trp Lys Lys Lys Gin Pro Ser Leu Thr Arg Ala 
225 230 235 240 

Val Glu Asn lie Gin lie Asn Glu Glu Asp Asn Glu lie Ser Met Leu 
245 250 255 

Gin Glu Lys Glii Arg Glu Phe Gin Glu Val 
260 265 

(2) INFORMATION FOR , SEQ ID NO:58: 

(1) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 269 base pairs 

(B) TYPE:, nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: j oin ( 1 . . 2 10 223.. 258) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GAA TTC TTC CGT ATC TTC AAC CGT TCC ATC GAC GCT TTC AAA GAC TTC 4 8 

Glu Phe Phe Arg lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp Phe 

1 5- 10 15 • 

GTT GTT GCT TCC GAA ACC TCC GAC TGC GTT GTT TCC TCC ACC CTG TCT 96 
Val Val Ala Ser Glu Thr Ser Asp Cys Val Val Ser Ser Thr Leu Ser 
20 25 30 

CCG GAA AAA GAC TCC CGT GTT TCG GTT ACC AAA CCG TTC ATG CTG CCG 14 4 

Pro Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro 
35 40 45 

CCG GTT GCT GCT TCC TCC CTG CGT AAC GAC TCC TCC TCC TCC AAC TCC 192 
Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Ser 
50 55 60 

AAA TAC ATC TAC CTG ATC TAATAGGATC CG GTT ACC AAA CCG TTC ATG 24 0 

Lys Tyr lie Tyr Leu lie Val Thr Lys Pro Phe Met 

65 70 75 
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CTG CCG CCG GTT GCT GOT TAATAGGATC C 2 69 

Leu Pro Pro Val Ala Ala 
80 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Glu Phe Phe Arg lie Phe Asn Arg Ser.Ile Asp Ala Phe Lys Asp Phe 
15 10 15 

Val Val Ala Ser Glu Thr Ser Asp Cys Val Val Ser Ser Thr Leu Ser 
20 25 30 

Pro Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro 
35 40 45 

Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Ser 
50 55 60 

Lys Tyr He Tyr Leu He Val Thr Lys Pro Phe Met Leu Pro Pro Val 
65 70 75 80 

Ala Ala 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1404 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 184.. 1002 

(ix) FEATURE: 

(A) NAME/KEY: mat_ peptide 

(B) LOCATION: 259.'". 1002 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

CCGCCTCGCG CCGAGACTAG AAGCGCTGCG GG^^AGCAGGG ACAGTGGAGA GGGCGCTGCG 60 

CTCGGGCTAC CCAATGCGTG GACTATCTGC CGCCGCTGTT CGTGCAATAT GCTGGAGCTC 120 

CAGAACAGCT AAACGGAGTC GCCACACCAC TGTTTGTGCT GGATCGCAGC GCTGCCTTTC 180 

CTT ATG AAG AAG ACA CAA ACT TGG ATT CTC ACT TGC ATT TAT CTT CAG 22 8 

Met Lys Lys Thr Gin Thr Trp He Leu Thr Cys He Tyr Leu Gin 
-25 -20 -15 
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CTG CTC CTA TTT AAT CCT CTC GTC AAA ACT GAA GGG ATC TGC AGG AAT 27 6 

Leu Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arq Asn 
-10 -5 15 

CGT GTG ACT AAT AAT GTA A7^ GAC GTC ACT AAA TTG GTG GCA AAT CTT 32 4 

Arg Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu 
10 15 20 

CCA AAA GAC TAG ATG ATA ACC CTC AAA TAT GTC CCC GGG ATG GAT GTT 372 
Pro Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val 
25 30 35 

TTG CCA AGT CAT TGT TGG ATA AGC GAG ATG GTA GTA CAA TTG TCA GAC 4 20 

Leu Pro Ser His Cys Trp He Ser Glu Met Val Val Gin Leu Ser Asp 
40 45 50 

AGC TTG ACT GAT CTT CTG GAC AAG TTT TCA AAT ATT TCT GAA GGC TTG 4 68 

Ser Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu 
55 60 65 70 

AGT AAT TAT TCC ATC ATA GAC AAA CTT GTG AAT ATA GTG GAT GAC CTT 516 
Ser Asn Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu 
75 80 85 

GTG GAG TGC GTG AAA GAA AAC TCA TCT AAG GAT CTA AAA AAA TCA TTC 5 64 

Val Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe 
90 95 100 

AAG AGC CCA GAA CCC AGG CTC TTT ACT CCT GAA GAA TTC TTT AGA ATT 612 
Lys Ser Pro Glu Fro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He 
105. 110 115 

TTT AAT AGA TCC ATT GAT GCC TTC AAG GAC TTT GTA GTG GCA TCT GAA 660 
Phe Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu 
120 125 130 

ACT AGT GAT TGT GTG GTT TCT TCA ACA TTA AGT CCT GAG AAA GAT TCC '";708 
Thr Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser - 
135 140 145 150 

AGA GTC AGT GTC ACA AAA CCA TTT ATG TTA CCC CCT GTT GCA GCC AGC 7 56 

Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser 
155 160 165 

TCC CTT AGG AAT GAC AGC AGT AGC AGT AAT AGG AAG GCC AAA AAT CCC 804 
Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro 
170 175 180 

CCT GGA GAC TCC AGC CTA CAC TGG GCA GCC ATG GCA TTG CCA GCA TTG 8 52 

Pro Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu Pro Ala Leu 
185 .190 195 

TTT TCT CTT ATA ATT GGC TTT GCT TTT GGA GCC TTA TAC TGG AAG AAG 900 
Phe Ser Leu He He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys 
200 205 210 

AGA CAG CCA AGT CTT ACA AGG GCA GTT GAA AAT ATA CAA ATT AAT GAA 94 8 

Arg Gin Pro Ser Leu Thr Arg Ala Val Glu Asn He Gin He Asn Glu 
215 220 225 230 

GAG GAT AAT GAG ATA AGT ATG TTG CAA GAG AAA GAG AGA GAG TTT CAA 996 
Glu Asp Asn Glu He Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin 
235 240 245 
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GAA GTG TAATTGTGGC TTGTATCAAC ACTGTTACTT TCGTACATTG GCTGGTT^ACA 1052 
Glu Val 

GTTCATGTTT GCTTCATAAA TGAAGCAGCT TTAAACAAAT TCATATTCTG TCTGGAGTGA 1112 

CAGACCACAT CTTTATCTGT TCTTGCTACC CATGACTTTA TATGGATGAT TCAGAAATTG 1172 

GAACAGAATG TTTTACTGTG AAACTGGCAC TGAATTAATC ATCTATAAAG AAGAACTTGC 12 32 

ATGGAGCAGG ACTCTATTTT AAGGACTGCG GGACTTGGGT CTCATTTAGA ACTTGCAGCT 12 92 

GATGTTGGAA GAGAAAGCAC GTGTCTCAGA CTGCATGTAC CATTTGCATG GCTCCAGAAA 1352 

TGTCTAAATG CTGAAAAAAC ACCTAGCTTT ATTCTTCAGA TACAAACTGC AG 14 04 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Met Lys Lys Thr Gin Thr Trp He Leu Thr Cys He Tyr Leu Gin Leu 
-2-5 -. -20 -15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly He Cys Arg A.sn Arg 
-5 15 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 , 30 35 

Pro Ser His Cys Trp He Ser Glu Met Val Val Gin Leu Ser Asp Ser 
40 45 50 55 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
60 65 70 

Asn Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu Val 
75 80 85 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
90 95 100 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe 
105 110 115 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
120 125 130 135 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
140 145 150 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
155 160 165 
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Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro Pro 
170 175 180 

Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu Pro Ala Leu Phe 
185 190 195 

Ser Leu lie lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Arg 
200 205 210 215 

Gin Pro Ser Leu Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu Glu 
220 225 230 

Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
235 240 245 

Val 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1088 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

' (ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 151.. 885 

(ix) FEATURE: 

(A) NAME/KEY: mat_ peptide 

(B) LOCATION: 226.^885 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 62: 

AGCAGGGACA GTGGAGAGGG CGCTGCGCTC GGGCTACCCA ATGCGTGGAC TATCTGCCGC ' " 60 

CGCTGTTCGT GCAATATGCT GGAGCTCCAG AACAGCTAAA CGGAGTCGCC ACACCACTGT 120 

TTGTGCTGGA TCGCAGCGCT GCCTTTCCTT ATG AAG AAG ACA CAA ACT TGG ATT 174 

Met Lys Lys Thr Gin Thr Trp lie 
-25 -20 

CTC ACT TGC ATT TAT CTT CAG CTG CTC CTA TTT AAT CCT CTC GTC AAA 222 
Leu Thr Cys lie Tyr Leu Gin Leu Leu Leu Phe Asn Pro Leu Val Lys 
-15 -10 -5 

ACT GAA GGG ATC TGC AGG AAT CGT GTG ACT AAT AAT GTA AAA GAC GTC 270 
Thr Glu Gly lie Cys Arg Asn Arg Val Thr Asn Asn Val Lys Asp Val 
1 5 10 15 

ACT AAA TTG GTG GCA AAT CTT CCA AAA GAC TAC ATG ATA ACC CTC AAA 318 
Thr Lys Leu Val Ala Asn Leu Pro Lys Asp Tyr Met lie Thr Leu Lys 
20 25 30 

TAT GTC CCC GGG ATG GAT GTT TTG CCA AGT CAT TGT TGG ATA AGC GAG 3 66 

Tyr Val Pro Gly Met Asp Val Leu Pro Ser His Cys Trp lie Ser Glu 
35 40 45 
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ATG GTA GTA CAA TTG TCA GAG AGC TTG ACT GAT CTT CTG GAG AAG TTT 414 
Met Val Val Gin Leu Ser Asp Ser Leu Thr Asp Leu Leu Asp Lys Phe 
50 55 60 

TCA AAT ATT TCT GAA GGC TTG AGT AAT TAT TCC ATC ATA GAG AAA CTT 4 62 

Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr Ser lie lie Asp Lys Leu 
65 70 75 

GTG AAT ATA GTG GAT GAC CTT GTG GAG TGC GTG AAA GAA AAC TCA TCT 510 
Val Asn lie Val Asp Asp Leu Val Glu Cys Val Lys Glu Asn Ser Ser 
80 85 90 95 

AAG GAT GTA AAA AAA TCA TTC AAG AGC CCA GAA CCC AGG CTG TTT ACT 558 
Lys Asp Leu Lys Lys Ser Phe Lys Ser Pro Glu Pro Arg Leu Phe Thr 
100 105 110 

CCT GAA GAA TTC TTT AGA ATT TTT AAT AGA TCC ATT GAT GCC TTC AAG 60 6 

Pro Glu Glu Phe Phe Arg lie Phe Asn Arg Ser lie Asp Ala Phe Lys 
115 120 125 

GAC TTT GTA GTG GCA TCT GAA ACT AGT GAT TGT GTG GTT TCT TCA ACA 65 4 

Asp Phe Val Val Ala Ser Glu Thr Ser Asp Cys Val Val Ser Ser Thr 
130 135 140 

TTA AGT CCT GAG AAA GGG AAG GCC AAA AAT CCC CCT GGA GAC TCC AGC 7 02 

Leu Ser Pro Glu Lys Gly Lys Ala Lys Asn Pro Pro Gly Asp Ser Ser 
145 150 155 

CTA CAC TGG GCA GCC ATG GCA TTG CCA ■ GCA TTG TTT TCT CTT ATA ATT .750 
Leu His Trp Ala Ala Met Ala Leu Pro Ala Leu Phe Ser Leu lie He 
160 165 170 175 

GGC TTT GCT TTT GGA GCC TTA TAG TGG AAG AAG AGA CAG CCA AGT CTT 798 
Gly Phe Ala Phe Gly Ala Leu. Tyr Trp Lys Lvs Arg Gin Pro Ser Leu 
180 185 190 

ACA AGG GCA GTT GAA AAT ATA CAA ATT AAT GAA GAG GAT AAT GAG ATA • 84 6 

Thr Arg Ala Val Glu Asn He' Gin He Asn Glu Glu Asp Asn Glu He 
195 200 205 

AGT ATG TTG CAA GAG AAA GAG AGA GAG TTT CAA GAA GTG TAATTGTGGC 8 95 

Ser Met Leu' Gin Glu Lys Glu Arg Glu Phe Gin Glu Val 
210 215 220 

TTGTATCAAC ACTGTTACTT TCGTACATTG GCTGGTAACA GTTCATGTTT GCTTCATAAA . 955 

TGAJVGCAGCT TTAAACAAAT TCATATTCTG TCTGGAGTGA CAGACCACAT CTTTATCTGT 1015 

TCTTGCTACC CATGACTTTA TATGGATGAT TCAGAAATTG GAACAGAATG TTTTACTGTG 1075 

AAACTGGCAC TGA 1088 

(2) INFORMATION FOR SEQ ID NO: 63:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
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Met Lys Lys Thr Gin Thr Trp He Leu Thr Cys He Tyr Leu Gin Leu 
-25 -20 -15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly He Cys Arg Asn Arg 
-5 15 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp He Ser Glu Met Val Val Gin Leu Ser Asp Ser 
40 45 50 55 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
60 "65 .70 

Asn Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu Val 
75 80 85 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
90 95 100 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe 
105 110 115 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
12C' 125 130 155 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Gly Lys Ala 
140 145 150 

Lys. Asn Pro Pro Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu 
155 160 165 

Pro- Ala Leu Phe Ser Leu He He Gly Phe Ala Phe Gly Ala Leu Tyr 
17Q 175 180 

Trp Lys Lys Arg Gin Pro Ser Leu Thr Arg Ala Val Glu Asn He Gin 
■ 185 190 195 

He Asn Glu Glu Asp Asn Glu He Ser Met Leu Gin Glu Lys Glu Arg 
200 205 210 215 

Glu Phe Gin Glu Val 
220 

(2) INFORMATION FOR SEQ ID NO: 64 : 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Glu Glu He Cys Arg Asn Pro Val Thr Asp Asn Val Lys Asp He Thr 
15 10 15 
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Lys Leu Val Ala Asn Leu Pro Asn Asp Tyr Met lie Thr Leu Asn Tyr 
20 25 30 

Val Ala Gly Met Asp Val Leu Pro Ser His Xaa Trp Leu Arg Asp 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

lie Thr Thr Leu Asn Tyr Val Ala Gly Met 
15 10 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66:. 

Val Ala Ser Asp Thr Ser Asp Cys Val Leu Ser Xaa Xaa Leu Gly Pro :* 
15 10 .15 

• Glu Lys Asp Ser Arg Val Ser Val Xaa Lys. 

20 25 

(2). INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
: (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67; 

Asp Val Leu Pro Ser His Cys Trp Leu Arg Asp Met 
1 5 ■ 10 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE:, amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Glu Glu Asn Ala Pro Lys Asn Val Glu Ser Leu Lys Lys Pro Thr Arg 
15 .10 15 

Asn Phe Thr Pro Glu Glu Phe Phe Ser lie Phe Asp Arg Ser lie Asp 
■ 20 25 30 

Ala 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Glu Ser Leu Lys Lys Pro Glu Thr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 5 amino acids 
(3) TYPE: amine acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii), MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Val Ser Val Xaa Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO : 7 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

lie Val Asp Asp Leu Val Ala Ala Met Glu Glu Asn Ala Pro Lys 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Asn Phe Thr Pro Glu Glu Phe Phe Ser lie Phe Xaa Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Leu Val Ala Asn Leu Pro Asn Asp Tyr Met lie Thr Leu Asn Tyr Val 
1 5 10 - 15 

Ala Gly Asp Asp Val Leu Pro Ser His Cys Trp Leu Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 amino acids . 
(3) TYPE: :amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Ser He Asp Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr Ser Asp 
1 5 10 15 

Cys Val Leu Ser Xaa Xaa Leu Gly- 
20 

(2) INFORMATION FOR. SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Glu Ser Leu Lys Lys Pro Glu Thr, Arg Asn Phe Thr Pro Glu Glu Phe 
15 10 15 

Phe Ser He Phe Xaa Arg 
20 

(2) INFORMATION FOR SEQ ID NO : 7 6 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 6 : 

Giu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe 
1 5 . . 10 15 

Phe Ser lie Phe Asp Arg 
20 

(2) INFORbdATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Asn Ala Pro Lys Asn Val Lys Glu 
1 5 . 

(2) INFORMATION FOR. SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: . 
(A) LENGTH: 16 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Ser Arg Val Ser Val Xaa Lys Pro^ Phe Met Leu Pro Pro Val Ala Ala 
15 10 15 

(2) INFORMATION FOR SEQ ID NO : 7 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe Phe 
15 10 15 

Ser He Phe Asn Arg Ser He Asp- Ala Phe Lys Asp Phe Met Val Ala 
20 25 30 
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Ser Asp 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids: 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single . 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe Phe 
1 5 ,10 15 

Ser lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Met. Val Ala 
20. 25 30 

Ser Asp Thr Ser Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 amino acids 

(B) TYPE:. amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Leu Arg Asp Met Val Thr His Leu Ser Val Ser Leu Thr Thr Leu Leu 
1 5 10 15 

Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr Ser lie lie 
20 25 30 

Asp Lys Leu Gly Lys lie Val Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 83: 
(i) SEQUENCE CHARACTERISTICS: 
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{A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
Pro Val Ala Ala 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CxHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 
CCTGAGAAAG ATTCCAGAGTC 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs . 

(B) TYPE:, nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 
CTGCAGTTTG TATCTGAAG 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 
CATATAAAGT CATGGGTAG 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
ACTTGTGTCT TCTTCATTUVG GAAAGGC 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
TGTACGAAAG TAACAGTGTT G 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
ACTGCTCCTA TTTAATCCTC TG 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
CACTGACTCT GGAATCTTTC TCA 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
TCGACCCGGA TCCCC 

(2) INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
TCGAGGGGAT CCGGG 15 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
TCTTCTTCAT GGCGGCGGCA AGCTT 25 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids ■ 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 

Asp Ser Arg Val Ser Val Xaa Lys Pro Phe Phe Met Leu Pro Pro Val" 
1 5 ,10 15 

Ala Ala 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Asp Ser Arg Val Ser Val Thr Lys Pro Phe Phe Met Leu Pro Pro Val 
1 5 .10 15 

Ala Ala 



(2) INFORMATION FOR SEQ ID NO: 96: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
CGATTTGATT CTAGAAGGAG GAATAACATA TGGTTAACGC GTTGGAATTC GGTAC 55 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID-NO:97: 
CGAATTCCAA CGCGTTAACC ATATGTTATT CCTCCTTCTA GAATCAAAT 4 9 

(2) INFORMATION FOR SEQ ID NO: 98: - 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base. pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
TATGCAGGA " 9 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GATCTCCTGC A 11 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



TATGGAAGGT ATCTGCA 



17 



(2) INFORMATION FOR SEQ ID NO: 101: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE:, nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 



GATACCTTCC A 
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(2) INFORMATION FOR SEQ ID NO: 102: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE:' nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

TTTCCTTATG 1'^ 

(2) INFORMATION FOB SEQ ID NO: 103: . 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 
GCCGCCGCCA TG 12 
(2) INFORMATION FOR SEQ ID NO: 104 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA ■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 
TCTTCTTCAT GGCGGCGGCA AGCTT 2 5 
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